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Reply to Non-Final Office Action of April 2K 2004 

Amendments to the Specification 

Please replace the Abstract with the replacement abstract included as an appendix. 
Applicant makes the following amendments to the identifying numerals of Figure 1 in the 
corresponding text. No new matter has been added. 

Please replace paragraphs [0026] - [0034] with the following amended 

paragraphs: 

[0026] FIG. 1 illustrates an exemplary hardware and software environment that could be used to 
implement the database system described below. In the exemplary environment, a computer 
system 100 is comprised of one or more processing units (PUs) 102j^, also known as processors 
or nodes, which are interconnected by a network 104. Each of the PUs 102jusi_W3-is coupled to 
zero or more fixed and/or removable data storage units (DSUs) 106jjsi, such as disk drives, that 
store one or more relational databases. Further, each of the PUs 102 i^ 403-is coupled to zero or 
more data communications units (DCUs) 1 08, such as network interfaces, that communicate with 
one or more remote systems or devices. 

[0027] Operators of the computer system 100 typically use a -one of the workstations 1 IOi. a^ a 
terminal, a_computer, or another input device to interact with the computer system 100. This 
interaction generally comprises queries that conform to the Structured Query Language (SQL) 
standard, and invoke functions performed by a Relational DataBase Management System 
(RDBMS) executed by the system 100, 

[0028] In one example, the RDBMS comprises the Teradata® product offered by NCR 
Corporation, the assignee of the present invention, and includes one or more Parallel Database 
Extensions (PDEs) 1 12| .n^ Parsing Engines (PEs) 114j^, and Access Module Processors 
(AMPs) 11 6i^. These components of the RDBMS perform the functions necessary to 
implement the RDBMS and SQL functions, i.e., definition, compilation, interpretation, 
optimization, database access control, database retrieval, and database update. 
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[0029] Generally, the PDEs 112i_Nj4 42. PEs 114i^, and AMPs 116i_r 446-are tangibly 
embodied in and/or accessible from a device, media, carrier, or signal, such as RAM, ROM, one 
or more of the DSUs 106, and/or a remote system or device communicating with the computer 
system 100 via one or more of the DCUs 108. The PDEs 112ii_n443, PEs 114i^ -I44, and AMPs 
1 1 6j^ 444-each comprise logic and/or data which, when executed, invoked, and/or interpreted 
by the PUs 102j_Ni 4^3-of the computer system 100, cause the necessary steps or elements 
described below to be performed. 

[0030] Those skilled in the art will recognize that the exemplary environment illustrated in 
FIG. 1 is not intended to limit the present invention. Indeed, those skilled in the art will 
recognize that other alternative environments may be used without departing from the scope of 
the present invention. In addition, it should be understood that the present invention may also 
apply to components other than those disclosed herein. 

[0031] In an example system, work is divided among the PUs 102j_n] W3rin the system 100 by 
spreading the storage of a partitioned relational database 1 1 8 managed by the RDBMS across 
multiple AMPs 116 and the DSUs i06jjvi_+©6-( which are managed by the AMPs 116^:446). 
Thus, one of the a DSUs 106| .m may store only a subset of rows that comprise a table in the 
partitioned database 1 18 and work is managed by the system 100 so that the task of operating on 
each subset of rows is performed by the AMPs 1 1 6j^ 446-managing the DSUs 106jjvj[ 4-Q6-that 
store the subset of rows. 

[0032] The PDEs 1 12jjg 443-provide a high speed, low latency, message-passing layer for use in 
communicating between the PEs 114jj_444-and AMPs 1 1 6j_r446. Further, each of the PDEs 
1 1 2 i_M 442-is an application programming interface (API) that allows the RDBMS to operate 
under either the UNIX MP-RAS or WINDOWS NT operating systems, in that each of the PDEs 

1 12 ] N-.4-^2-isolates most of the operating system dependent functions from the RDBMS, and 
performs many operations such as shared memory management, message passing, and process or 
thread creation. 

[0033] The PEs 1 14 j_R 4-4-4-handle communications, session control, optimization and query plan 
generation and control, while the AMPs 116j_r 446-handle actual database 1 18|..m table 

HOU03:976527.I - Page 4 of 20- Atty: Docket No. 1 0239 

Express Mail Label: EV449865944US 



I 



Appl. No. 10/038,783 

Reply to Non-Final Office Action of April 21, 2004 

manipulation. The PEs 1 14 fully parallelize all functions among the AMPs 116 j_f444. Both the 
PEs 1 14ji_R 4 44-and AMPs 1 16|..c .ti^are known as "virtual processors" or "vprocs". 

[0034] The vproc concept is accomplished by executing multiple threads or processes in a PU 
102, wherein each thread or process is encapsulated within a vproc. The vproc concept adds a 
level of abstraction between the multi-threading of a work unit and the physical layout of the 
parallel processing computer system 100. Moreover, when one of thea PUs \02]^ itself is 
comprised of a plurality of processors or nodes, the vproc concept provides for intra-node as well 
as the inter-node parallelism. 

Please replace paragraphs [0036] - [0037] with the following amended 

paragraphs: 

[0036] The system 100 does face the issue of how to divide a query or other unit of work into 
smaller sub-units, each of which can be assigned to one of the an AMPs 1 16 i x^46. In one 
example, data partitioning and repartitioning may be performed, in order to enhance parallel 
processing across multiple AMPs 1 16iMr 446. For example, the database 1 1 8 may be hash 
partitioned, range partitioned, or not partitioned at all (i.e., locally processed). 

[0037] Hash partitioning is a partitioning scheme in which a predefined hash function and map is 
used to assign records to AMPs 1 16 j!j4-l-6. wherein the hashing function generates a hash 
"bucket" number and the hash bucket numbers are mapped to AMPs 116j^-446. Range 
partitioning is a partitioning scheme in which each of the AMPs 1 1 6i_r 446-manages the records 
falling within a range of values, wherein the entire data set is divided into as many ranges as 
there are AMPs 1 16i..cr H-6. No partitioning means that a single one of the AMP ll6\ cJ 4S 
manages all of the records. 

Please replace paragraphs [0039] - [0044] with the following amended 

paragraphs: 

[0039] Block 200 represents SQL statements being accepted by one of the PEs 114^^444. 
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[0040] Block 202 represents the SQL statements being transformed by a Compiler or Interpreter 
subsystem of one of the PEs 114|„b _11 4 into an execution plan. Moreover, an Optimizer 
subsystem of one of the PEs 1 14j _R 1 1 4 may transform or optimize the execution plan in a 
manner described in more detail later in this specification. 

[0041] Block 204 represents one of the PEs 1 14j^ 444-generating one or more "step messages" 
from the execution plan, wherein each step message is assigned to one of the an AMPs 1 16 that 
manages the desired records. As mentioned above, the rows of the tables in the database 118 
may be partitioned or otherwise distributed among multiple AMPs 1 16i„c j44, so that multiple 
AMPs il6jL£_446-can work at the same time on the data of a given table. If a request is for data 
in a single row, one of the PEs 1 14j_R transmits the steps to one of the AMPs 1 16iMr 446-in 
which the data resides. If the request is for multiple rows, then the steps are forwarded to all 
participating AMPs 116jx. Since the tables in the database 1 18i..m rnay be partitioned or 
distributed across the DSUs 106i..m of the AMPs 1 16 1 16j _r —the workload of performing the 
SQL query can be balanced among AMPs ii6i£_++6-and DSUs 106i..m» 

[0042] Block 204 also represents one of the PEs 1 Hjjr 444'Sending the step messages to their 
assigned AMPs 1 16| .(4 46* 

[0043] Block 206 represents one of the AMPs 1 16| .c performing the required data manipulation 
associated with the step messages received from one of the PEs 1 1 4j _R'144. and then transmitting 
appropriate responses back to one of the PEs 1 1 4j_Rj 44. 

[0044] Block 208 represents one of the PEs 1 14j_R 444-merging the responses that come from 
the AMPsil6ix446. 

Please replace paragraph [0048] with the following amended paragraph: 

[0048] In this example, the tables 300 and 305 are joined according to equivalence relations 
indicated in the query. It is the job of the Optimizer subsystem of one of the PEs 1 14|„b» at step 
202 of FIG. 2, to select a least costly join plan. 



HOU03:976527.I 



- Page 6 of 20- 



Atty: Docket No. 10239 
Express Mail Label: EV449865944US 



